Robust Multilingual Part-of-Speech Tagging via Adversarial Training

机译：通过对抗训练实现健壮的多语言词性标注

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Adversarial training (AT) is a powerful regularization method for neuralnetworks, aiming to achieve robustness to input perturbations. Yet, thespecific effects of the robustness obtained by AT are still unclear in thecontext of natural language processing. In this paper, we propose and analyze aneural POS tagging model that exploits adversarial training (AT). In ourexperiments on the Penn Treebank WSJ corpus and the Universal Dependencies (UD)dataset (28 languages), we find that AT not only improves the overall taggingaccuracy, but also 1) largely prevents overfitting in low resource languagesand 2) boosts tagging accuracy for rare / unseen words. The proposed POS taggerachieves state-of-the-art performance on nearly all of the languages in UDv1.2. We also demonstrate that 3) the improved tagging performance by ATcontributes to the downstream task of dependency parsing, and that 4) AT helpsthe model to learn cleaner word and internal representations. These positiveresults motivate further use of AT for natural language tasks.

机译：对抗训练（AT）是一种强大的神经网络正则化方法，旨在实现对输入扰动的鲁棒性。然而，在自然语言处理的背景下，由AT获得的鲁棒性的具体效果仍不清楚。在本文中，我们提出并分析了利用对抗训练（AT）的非理性POS标记模型。在我们对Penn Treebank WSJ语料库和通用依赖关系（UD）数据集（28种语言）的实验中，我们发现AT不仅可以提高整体标签准确性，而且1）可以很大程度上防止资源不足的语言过度适应； 2）可以提高标签准确性，从而减少了使用率/看不见的话。提议的POS标记器在UDv1.2中的几乎所有语言上均具有最先进的性能。我们还证明了3）AT改进的标记性能有助于依赖性解析的下游任务，并且4）AT帮助模型学习更简洁的单词和内部表示。这些积极的结果促使AT进一步用于自然语言任务。

著录项

作者
Yasunaga, Michihiro; Kasai, Jungo; Radev, Dragomir;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Multilingual part-of-speech tagging with weightless neural networks [J] . Carneiro Hugo C. C., Franca Felipe M. G., Lima Priscila M. V. Neural Networks: The Official Journal of the International Neural Network Society . 2015,第Null期

机译：失重神经网络的多语言词性标注
2. Multilingual Part-of-Speech Tagging: Two Unsupervised Approaches [J] . Barzilay R., Eisenstein J., Naseem T., The Journal of Artificial Intelligence Research . 2009,第5期

机译：多语言词性标记：两种无监督方法
3. Multilingual Part-of-Speech Tagging: Two Unsupervised Approaches [J] . Tahira Naseem, Benjamin Snyder, Jacob Eisenstein, The Journal of Artificial Intelligence Research . 2009,第Null期

机译：多语言词性标记：两种无监督方法
4. Robust Multilingual Part-of-Speech Tagging via Adversarial Training [C] . Michihiro Yasunaga, Jungo Kasai, Dragomir Radev Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies . 2018

机译：通过对抗训练进行健壮的多语言词性标注
5. Robust Vision and Language Inference via Semantics Transformed Adversarial Training [D] . Chaudhary, Abhishek. 2021

机译：通过语义转型对抗对抗培训强大的视觉和语言推断
6. Heuristic Sample Selection to Minimize Reference Standard Training Set for a Part-Of-Speech Tagger [O] . Kaihong Liu, Wendy Chapman, Rebecca Hwa, 2007

机译：启发式样本选择以最大程度地减少词性标注器的参考标准训练集
7. Part-of-Speech Tagging for Twitter with Adversarial Neural Networks [O] . Tao Gui, Qi Zhang, Haoran Huang, 2017

机译：具有对冲神经网络的Twitter的词性标记

Robust Multilingual Part-of-Speech Tagging via Adversarial Training

摘要

著录项

相似文献

相关主题

期刊订阅